GMM and ARVM cooperation and competition for text-independent speaker recognition on telephone speech
نویسندگان
چکیده
We develop a cooperation and a competition of two different natures modelizations. The first one, the GMM [1], is a modelization of the parametrisation distribution of the speaker speech. The second, the ARVM [2, 3], is a modelization of the speaker speech spectral evolution. To allow cooperation and competition between different modelizations we use a classical measure normalization. We investigate the cooperation/competition of the GMM and ARVM on two levels : global and analytic. In order to improve the performances, we used results of previous study [4] and repeat the experiments on selected phonetic segments.
منابع مشابه
Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch
We compare speaker recognition performance of Vector Quantization (VQ), Gaussian Mixture Modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments. We study identi cation (clo...
متن کاملComparison of text - independent speaker recognition methodson telephone speech with
We compare speaker recognition performance of Vector Quantization (VQ), Gaussian Mixture Modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments. We study identiication (clo...
متن کاملText-constrained speaker recognition on a text-independent task
We present an approach to speaker recognition in the textindependent domain of conversational telephone speech using a text-constrained system designed to employ select highfrequency keywords in the speech stream. The system uses speaker word models generated via Hidden Markov Models (HMMs) — a departure from the traditional Gaussian Mixture Model (GMM) approach dominant in text-independent wor...
متن کاملAn Evaluation of DTW, AA and ARVM for Fixed-Text Speaker Identification
Three different methodologies for automatic speaker identification have been evaluated in the paper, namely the well known Dynamic Time Warping (DTW), the Auto-Regressive Vector Models (ARVM) and an Algebraic Approach (AA). The aim of our study is to examine the effectiveness of these approaches in the fixed-text speaker identification task with short phrases in Bulgarian language collected ove...
متن کاملTelephone based speaker recognition using multiple binary classifier and Gaussian mixture models
The present study evaluates MBCM and GMM solutions for both ASV and ASI problems involving text-independent telephone speech from the King speech database. The MBCM's accuracy is enhanced by selectively removing those classi ers within the model which perform worst (pruning). An unpruned MBCM outperforms a GMM for ASV and speakers taken from within the same dialectic region (San Diego, CA). Onc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996